Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition
نویسندگان
چکیده
e present a new filter bank design method for subband adaptive beamforming. Filter bank design for adaptive filtering poses many problems not encountered in more traditional applications such as subband coding of speech or music. The popular class of perfect reconstruction filter banks is not well-suited for applications involving adaptive filtering because perfect reconstruction is achieved through alias cancellation, which functions correctly only if the outputs of individual subbands are not subject to arbitrary magnitude scaling and phase shifts. In this work, we design analysis and synthesis prototypes for modulated filter banks so as to minimize each aliasing term individually. We then show that the total response error can be driven to zero by constraining the analysis and synthesis prototypes to be Nyquist(M) filters. We show that the proposed filter banks are more robust for aliasing caused by adaptive beamforming than conventional methods. Furthermore, we demonstrate the effectiveness of our design technique through a set of automatic speech recognition experiments on the multi-channel, farfield speech data from the PASCAL Speech Separation Challenge. In our system, speech signals are first transformed into the subband domain with the proposed filter banks, and thereafter the subband components are processed with a beamforming algorithm. Following beamforming, post-filtering and binary masking are performed to further enhance the speech by removing residual noise and undesired speech. The experimental results prove that our beamforming system with the proposed filter banks achieves the best recognition performance, a 39.6% word error rate (WER), with half the amount of computation of that of the conventional filter banks while the perfect reconstruction filter banks provided a 44.4% WER.
منابع مشابه
Spatial - Temporal Subband Beamforming for Near Field Adaptive Array Processing
This thesis investigates broadband adaptive beamforming for signal targets located in the near field of an array. The primary application of this research is hands-free sound pickup and speech enhancement for wideband telephony. The technical challenges are three-fold. Broadband beamformers are difficult to design due to large frequency dependent beampattern variations and reduced performances ...
متن کاملFeature Extracting in the Presence of Environmental Noise, using Subband Adaptive Filtering
In this work, a new feature extracting method in noisy environments is proposed. The approach is based on subband decomposition of speech signals followed by adaptive filtering in the noisiest subbbands of speech. The speech decomposition is obtained using low complexity octave filter bank, while adaptive filtering is performed using the normalized least mean square algorithm. The performance o...
متن کاملTo Separate Speech! A System for Recognizing Simultaneous Speech
The PASCAL Speech Separation Challenge (SSC) is based on a corpus of sentences from the Wall Street Journal task read by two speakers simultaneously and captured with two circular eight-channel microphone arrays. This work describes our system for the recognition of such simultaneous speech. Our system has four principal components: A person tracker returns the locations of both active speakers...
متن کاملUniform and warped low delay filter-banks for speech enhancement
A versatile filter-bank concept for adaptive subband filtering is proposed, which achieves a significantly lower algorithmic signal delay than commonly used analysissynthesis filter-banks. It is derived as an efficient implementation of the filter-bank summation method and performs time-domain filtering with coefficients adapted in the uniform or non-uniform frequency-domain. The frequency warp...
متن کاملOptimal and Adaptive Subband Beamforming Principles and Applications
This part discusses signal processing methods for speech extraction in use with voice communication applications such as personal digital assistants (PDA:s), mobile telephone terminals and personal computers. The speaker will be distant from the device and thus the speech signal entering the device will be subject to reverberation as well as disturbed by background noise. Further more, the comp...
متن کامل